Search for: All records

Creators/Authors contains: "Tseng, George C."

« Prev Next »

Total Resources

3

Resource Type
Conference Paper

0

Conference Proceeding

0

Dataset

0

Journal Article

3

Workshop Report

0

Availability
Full Text / Resource Available

3

Citation Only

0

Save Results
Excel (limit 2000)
CSV (limit 5000)
XML (limit 5000)

Have feedback or suggestions for a way to improve these results?
!

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

High-dimension to high-dimension screening for detecting genome-wide epigenetic and noncoding RNA regulators of gene expression

https://doi.org/10.1093/bioinformatics/btac518

Ke, Hongjie ; Ren, Zhao ; Qi, Jianfei ; Chen, Shuo ; Tseng, George C. ; Ye, Zhenyao ; Ma, Tianzhou ; Alkan, ed., Can ( July 2022 , Bioinformatics)

Abstract Motivation
The advancement of high-throughput technology characterizes a wide variety of epigenetic modifications and noncoding RNAs across the genome involved in disease pathogenesis via regulating gene expression. The high dimensionality of both epigenetic/noncoding RNA and gene expression data make it challenging to identify the important regulators of genes. Conducting univariate test for each possible regulator–gene pair is subject to serious multiple comparison burden, and direct application of regularization methods to select regulator–gene pairs is computationally infeasible. Applying fast screening to reduce dimension first before regularization is more efficient and stable than applying regularization methods alone.
Results
We propose a novel screening method based on robust partial correlation to detect epigenetic and noncoding RNA regulators of gene expression over the whole genome, a problem that includes both high-dimensional predictors and high-dimensional responses. Compared to existing screening methods, our method is conceptually innovative that it reduces the dimension of both predictor and response, and screens at both node (regulators or genes) and edge (regulator–gene pairs) levels. We develop data-driven procedures to determine the conditional sets and the optimal screening threshold, and implement a fast iterative algorithm. Simulations and applications to long noncoding RNA and microRNA regulation in Kidney cancer and DNA methylation regulation in Glioblastoma Multiforme illustrate the validity and advantage of our method.
Availability and implementation
The R package, related source codes and real datasets used in this article are provided at https://github.com/kehongjie/rPCor.
Supplementary information
Supplementary data are available at Bioinformatics online.

more » « less
Unpaired data empowers association tests

https://doi.org/10.1093/bioinformatics/btaa886

Gong, Mingming ; Liu, Peng ; Sciurba, Frank C ; Stojanov, Petar ; Tao, Dacheng ; Tseng, George C ; Zhang, Kun ; Batmanghelich, Kayhan ( October 2020 , Bioinformatics)
Alfonso, Valencia (Ed.)
Abstract Motivation There is growing interest in the biomedical research community to incorporate retrospective data, available in healthcare systems, to shed light on associations between different biomarkers. Understanding the association between various types of biomedical data, such as genetic, blood biomarkers, imaging, etc. can provide a holistic understanding of human diseases. To formally test a hypothesized association between two types of data in Electronic Health Records (EHRs), one requires a substantial sample size with both data modalities to achieve a reasonable power. Current association test methods only allow using data from individuals who have both data modalities. Hence, researchers cannot take advantage of much larger EHR samples that includes individuals with at least one of the data types, which limits the power of the association test. Results We present a new method called the Semi-paired Association Test (SAT) that makes use of both paired and unpaired data. In contrast to classical approaches, incorporating unpaired data allows SAT to produce better control of false discovery and to improve the power of the association test. We study the properties of the new test theoretically and empirically, through a series of simulations and by applying our method on real studies in the context of Chronic Obstructive Pulmonary Disease. We are able to identify an association between the high-dimensional characterization of Computed Tomography chest images and several blood biomarkers as well as the expression of dozens of genes involved in the immune system. Availability and implementation Code is available on https://github.com/batmanlab/Semi-paired-Association-Test. Supplementary information Supplementary data are available at Bioinformatics online.
more » « less
Full Text Available
Variable screening with multiple studies

https://doi.org/10.5705/ss.202017.0439

Ma, Tianzhou ; Ren, Zhao ; Tseng, George C. ( January 2020 , Statistica Sinica)

Full Text Available